On March 17, 2020 President Trump referred to the Coronavirus as the “China Virus.” Shortly after, the number of anti-Chinese incidents started to increase across the United States. One aspect of public health that is often thrown to the wayside is how influential public officials and leaders are in disseminating public health information. Moreover, not only can their words change the public’s views on a health matter but it can also shift a nation’s perspective on someone’s identity. [ADD SOME BACKGROUND LITERATURE THAT BACKS THIS CLAIM UP] In addition, given the influence of identity politics we may expect the term “China Virus” to be more polarizing to certain identities and states. [ADD BACKGROUND LITERATURE + IDENTIFY SUBJECTS OF INTEREST]. There are fewer studies devoted to analyzing these aspects of the pandemic.
Thus,our project aims to explore the relationship and variability of interest in the term “China Virus” across states through political, demographic and COVID-19 characteristics. We ask the question, [INSERT OUR RESERACH QUESTION] ? In order to answer this question, we look to gain a better understanding of [INSERT NAME OF BAYESIAN METHODOLOGY]. Ultimately, we hope our research inspires others to explore and better understand the impact langauge and words have on the public during times of crisis.
Our first visualization is looking at the percent of residents that identify as white within the United States. As you can see, there is a higher percent of white identifying residents in the Midwest and northeast states. From this visualization we can also see that places like Texas, California, and New Mexico have much lower white identifying residents which could provide important information for us in our actual analysis.
As we can see from the density plot of the China Virus Interest during our time period 2020-03-14 - 2020-03-21, it behaves relatively normal with a small bump at 0. As a group we believe this bump occurs as 0 is the lowest value it can take and because of that limitation of the China Virus Interest we see a small bump around 0. One could argue against a normal distribution as it kinda looks a bit right skewed. But our team believes that a normal distribution is the best at describing the density.
When China Virus Interest equals 0 it means that no one in the state looked up the term in Google and this is prevalent at the 14 of March where there are 5 occurrences and the 21 of March where there are 4 occurrences. And in states such as DC, ND, SD, and VT.
This plot shows the relationship of “China Virus” search interest over grouped by region. This plots shows that there are certainly key events that trigger an uptick in searches overall. In this plot it is not clear which region may search China Virus more or less often, but it does show a that the regions move together in search interest, which would imply federal level events like a Donald Trump tweet to trigger these interest spikes.We can see that the variability in Google interest in the term China Virus is has quite a large range between states. There are very few states that have high densities among the upper echelons of the interest scale but there are some interesting peaks of densities among the lower values. For example, we can see that Alaska, Wyoming and Iowa have unusual peaks around the 25-50 range. It is is also interesting interesting to note that there isn’t an obvious mean or median value of China Virus interest among the states.
ggplotly(c)
This visualisation depicts the distribution of positive COVID-19 cases by region and by which political party won in the 2016 elections. We can see that Democrat states in the Midwest, Mountain, and West have a larger range and higher quantile metrics for positive cases overall. For the Northeast and South regions the mean of positive COVID-19 cases are higher but not significantly. This is an interesting pattern considering that poltical party affiliation appears to interact with the number of postive cases by region.